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Input file Fbh32142FL.seq; Output File 32142. trans 
Sequence length 2660 
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CCTTTNTNRCCACGCGTCCGAGAGCGCCCC6CAGTCTTCGCG6AAA6CGTTCG66GTA6GCG ATG GCT GCG ACG 12 
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LLHYHAIQAST 
CTG CTC CAC TAC CAT GCA ATC CAG GCA TCC ACC 

PMGVIGLILPP 
CCC ATG GGA GTA ATT GGC CTC ATC CTG CCA CCC 

RICPAL A.VGCT 
AGG ATT TGC CCT GCC CTG GCT GTG GGC TGC ACC 

PAPLLLAQLAG 
CCG GCG CCC CTC CTC CTG GCC CAG CTG GCG GGG 

NVVSGPASLVP 
AAT GTC GTC AGT GGC CCT GCG TCC CTG GTG CCC 

KVAFCGAPEEG 
AAG GTG GCC TTC TGC GGA GCC CCG GAG GAA GGG 

ECAEL GLALGT 
GAG TGT GCG GAG CTG GGC CTG GCG CTG GGG ACG 
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ADV DSAVEGVVDAAWSDRGP 304 

6CG GAC GTA GAC TCG GCC GTG GAG GGT GTC GTG GAC GCC GCC TGG TCC GAC CGC GGC CCG 912 

GGLRLLIQESVWDEAMRRLQ 324 

GGT GGC CTC AGG CTC CTC ATC CAG GAG TCT GTG TGG GAT GAA GCC ATG AGA CGG CTG CAG 972 

ERMGRLRSGRGLDGAVDMGA 344 

GAG CGG ATG GGG CGG CTT CGG AGT GGC CGA GGG CTG GAT GGG GCC GTG GAC ATG GGG GCC 1032 

RGAAACDLVQRFVRBAQSQG 364 

CGG GGG GCT GCC GCA TGT GAC CTG GTC CAG CGC TTT GTG CGT GAG GCC CAG AGC CAG GGT 1092 

AOVFQAGDVPSERPFYPPTL 384 

GCA CAG GTG TTC CAG GCT GGT GAT GTG CCT TCG GAA CGC CCA TTC TAT CCC CCA ACC TTG 1152 

VSNLPPASPCAQVEVPWPVV 404 

GTC TCC AAC CTG CCC CCA GCC TCC CCA TGT GCC CAG GTG GAG GTG CCG TGG CCT GTG GTC 1212 

VASPFRTAKTSALLVA. NGTPR 424 

GTG GCC TCC CCC TTC CGC ACA GCC AAG GAG GCA CTG TTG GTG GCC AAC GGG ACG CCC CGC 1272 

GGSASVWSERLGQALELGYG 444 

GGG GGC AGC GCC AGT GTG TGG AGC GAG AGG CTG GGG CAG GCG CTG GAG CTG GGC TAT GGG 1332 

LQVGTVWINAHGLRDPSVPT 464 

CTC CAG GTG GGC ACT GTC TGG ATC AAC GCC CAC GGC CTC AGA GAC CCT TCG GTG CCC ACA 1392 

GGCKESGCSWHGGPDGLYEY 484 

GGC GGC TGC AAG GAG AGT GGG TGT TCC TGG CAC GGG GGC CCA GAC GGG CTG TAT GAG TAT 1452 

LRPSGTPARLS CLSKNLNYD 504 

CTG CGG CCC TCA GGG ACC CCT GCC CGG CTG TCC TGC. CTC TCC AAG AAC CTG AAC TAT GAC 1512 

TPGLAVPSTLPAGPEIGPSP 524 

ACC TTT GGC CTC GCT GTG CCC TCA ACC CTG CCG GCT GGG CCT GAA ATA GGG CCC AGC CCA 1572 

APPYGLFVGGRFQAPGARSS 544 

GCA CCC CCC TAT GGG CTC TTC GTT GGG GGC CGT TTC CAG GCT CCT GGG GCC CGA AGC TCC 1632 

RPIRDSSGNLHGYVAEGGAK 564 

AGG CCC ATC CGG GAT TCG TCT GGC AAT CTC CAT GGC TAC GTG GCT GAG GGT GGA GCC AAG 1692 

DIRGAVEAAHQAFPGWAGQS 584 

GAC ATC CGA GGT GCT GTG GAG GCC GCT CAC CAG GCT TTC CCT GGC TGG GCG GGC CAG TCC 1752 

PGARAALLWALAAALERRKS 604 

CCA GGA GCC CGG GCA GCC CTG CTG TGG GCC CTG GCG GCT GCA CTG GAG CGC CGG AAG TCT 1812 
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_ T 1CRT.ERQGAELKAAEAEV 624 

ACC CTG GCC 4 AGG CTG GAG AGG CAG GGA GCG GAG CTC AAG GCT GCG GAG GCG GAG GTG 1872 

GAG CTG AGC GCA AGA CGA CTT CGG GCG TGG GGG GCC CGG GTG CAG GCC CAA GGC CAC ACC 1932 

r nifiPLRGPVLRLREPLGVL 664 

CTG CAG GTA Gc"c G^G CTG AGA GGC CCT GTG CTG CGC CTG CGG GAG CCG CTG GGT GTG CTG 1992 

GCT GT~G GTG TGT CCG GAC GAG TGG CCC CTG CTT GCC /tC G^G TCC CTG CTG GCT CCC GCC 2052 

r lV rNTVVMVPSAACPLLAL 704 

CTG GCC TAC GGC A^C ACT GTG GTC ATG GTG CCC AGT GCG GCC TGT CCT CTG CTG GCC CTG 2112 

„t, rnnMATVFPAGLANVVTG 724 

GAG GTC TGC CAG GAC A^G GCC ACC GTG TTC CCA GCA GGC CTG GCC AAC GTG GTG ACA GGA 2172 

_ nHT trCLALHQDVQAMWY 744 

GAC CGG GAC CAT CTG ACC CGC TGC CTG GCC TTG CAC CAA GAC GTC CAG GCC ATG TGG TAT 2232 

F G .. S A Q G S Q F V E W A S A G N L _K_ _P_ m 



T?C GGA TCA GCC cic GGT TCC C^G TTT GTC GAG TGG GCC TCG GCA GGA AAC CTC AAA CCG 2292 
-j SQ n(ippRAWDQEAEGAGP 784 
GW tL GCG AGC AGG GGC TGC CCG CGG GCC TGG GAC CAG GAG GCC GAG GGG GCA GGC CCA 2352 

GAG CTG GGG CTG CGA G^G GCG CGG ACC AAG G^C CTG TGG CTG CCT A^G GGG GAC TGA 2409 
TGCCTGAGCGCCACCTACTGCATTTTGGACACCTCACACCAAGGGGAGATGCACCCCACAGACACCTGGGACTTTCCCC 

TTCTGGTTCCTGTGTCTCCCAATAAACTCTCTGACCAACCCTAAAAAAAAAAAAAAAAAAAAAAAAAARWARMAACTTC 
TGGCAGATATGAGGCTTTTTTCTTTTTTTTT 

Fig. 1C 
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MAATRAGPRAREIFTSLEYGPVPESHACALAWLDTQDRCLGHYVNGKWLKPEHRNSVPCQ 
DPITGENLASCLQAQAEDVAAAVEAARMAFKGWSAHPGWRAQHLTRLAEVIQKHQRLLW 

TLE S LVTGRAVRE VRDGDVQL AQQ L LH YHAI Q AS TQE E AL AGWE PMGV I GL I f f c n 
FMMWRTCPALAVGCTWALVPPASPAPLLLAQLAGELGPFPGILNWSGPASLVPILASQ 

DRGPGGLRLLIQESVWDEAMRRLQERMGRLRSGRGLDGAVDMGARGAAACDLVQRFVREA 
OSQGAQVFQAGDVPSERPFYPPTLVSNLPPASPCAQVEVPWPVWASPFRTAKEALLVAN 
GTPRGGSASVWSERLGQALELGYGLQVGTVWINAHGLRDPSVPTGGCKESGCSWHGGPDG 

ARSSRPIRDSSGNLHGYVAEGGAKDIRGAVEAAHQAFPGWAGQSPGARAALLWALAAALE 
RRKSTLASRLERQGAELKAAEAEVELSARRLRAWGARVQAQGHTLQVAGLRGPVLRLREP 

lgvlIwcpdewpllafvsliapalaygntw^ 

WTGDRDHLTRCLALHQDVQAMWYFGSAQGSQFVEWASAGNLKPVWASRGCPRAWDQEAE 
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Fig. 3 
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Protein Family / Domain Matches, HMMer Version 2 

Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) 



HMM file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pf am5 . 0/Pf am 
/prod/ddm/wspace/orfanal/oa-script . 9519 . seq 



Query: 32142 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



aldedh Aldehyde dehydrogenase family 



Parsed for domains: 

Model Domain seq-f,seq-t 



aldedh 



1/1 



47 494 



hmm-f hmm-t 
"""""I ~~492 [] 



score 
149~8 



149.8 

E-value 
4"7e-41 



4.7e-41 



Alignments of top-scoring domains: 

aldedh: domain 1 of 1, from 47 to 494: score 149.8, E = 4.7e-41 

* - >ewvdsasgkt f evvNPankgevigrvpeataeDvdaAVkAAkeAf ks 
+w +++ + +++ +P + ge +++ +a+aeDv aAV AA+ Afk+ 
32142 47 KWLWPEHRNS VPCQDPIT - GENLASCLQAQAEDVAAAVEAARMAFKG 92 

GpwWakvpaseRariLrkladlieeredeLaaletlDlGKplaeAkgDte 
W++ p Ra+ L +la+ i+ ++ +L le+1 +G ++e+ + + 
32142 93 WS AHPGWRAQHLTRLAEVIQKHQRLLWTLESLVTGRAVREVRDG - D 138 

vgraideiryyagwarklmgerrvipslatdgdeelnytrrePlGVvgvI 
v+ a + ++y a +a+ t+ e ++ +eP GV+g I 
32142 139 VQLAQQ LLHYHAI QAS TQ EEALAGWEPMGVIGL I 172 

sPWNFPlllalwklapALAaGNTWIKPSEqTPlt. . alllaelieeaGa 
P F +1 ++w ++pALA G+TW + P+++ 111a 1 e G 
32142 172 LPPTFSFLEMMWRICPALAVGCTW ALVPPASpaPLLLAQLAGELG- 218 

nnlPkGVvnwpGfGaevGqaLlshpdidkisFTGSteVGklimeaAAak 
+G +nw G +a+ + L+s+p+i+k++F G +e G+ + ++ A + 
32142 219 - - PFPGILNWSG - PASLVPILASQPGIRKVAFCGAPEEGRALRRSLAGE 265 

nlkkVtLELGGKsPvIVfdDADLdkAverivfgaFgnaGQvCiApsRllv 
+ L LG s d AD d Ave++v +a G ++ R11+ 
32142 266 - CAELGLALGTESLLLLTDTADVDSAVEGWDAAWSDRG PGGLRLLI 311 



hesiydeFveklkervkklkliGdpldsdtniyGPHseqqfdrvlsyle 
+es+ de + +l+er+ +1+ G +ld + + G+ +++ d v +++ 
32142 312 QESVWDEAMRRLQERMGRLR - SGRGLDGAVDM - GAR - GAAACDLVQRFVR 358 

dgkeeGAkvlcGGerdeskeylggGyyvqPTif tdVtpdMklmkEEIFGP 
+++++GA+V + G ++ + + ++ PT+++++ p +++++ E+ P 
32142 359 EAQSQGAQVFQAGDVPSE RP FYPPTLVSNLPPASPCAQVEVPWP 402 

VlpiikfkdldEAIelaNdteYGLAayvFTkdilarafrvakaleaGiVw 
V++ f++ EA+ aN t+ G +a+v+++ 1 a +1++G+W 
32142 403 VWASPFRTAKEALLVANGTPRGGSASVWSER - LGQALELGYGLQVGTVW 451 



vNDvcvhaaepqlPFGGvHqSSGiGrehgGkygleeYteiKtVt irl< - * 
+N ++ +p++P GG K+ SG + ++ G++gl eY++ + rl 
32142 452 IN- - AHGLRD PS VPTGGCKE - SGCSWHG - GPDGLYE YLRPSGTPARL 



494 



Fig. 4 
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ProDom Matches 



Prodomld 



Start 
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Description 



Score 



View Prodom 135 Boxer 



1 Showing match HlGolj 



101 



770 



p99.2 (229) DHAL(IO) DHAB(IO) DHAM(7)// 
DEHYDROGENASE OXIDOREDUCTASE ALDEHYDE 
NAD PROTEIN CLASS SEMIALDEHYDE PRECURSOR 
TRANSIT PEPTIDE 



280 



Prodomld 



Start 



End 



Description 



Score 



View Prodom 135 | Boxer w\ [Showing match |t| (Goll 



>135 p99.2 (229) DHAL(IO) DHAB (10) DHAM(7) // DEHYDROGENASE OXIDOREDUCTASE 
ALDEHYDE NAD PROTEIN CLASS SEMIALDEHYDE PRECURSOR TRANSIT PEPTIDE 
Length =494 

Score = 280 (103.6 bits), Expect = 7.8e-22, P = 7.8e-22 
Identities = 87/289 (30%), Positives = 142/289 (49%) 

Query: 216 ELGPFPGILNWSG- - PASLVPILASQPGIRKVAFCGAPEEGRALRRSXXXXXXXXXXXX 273 

E G PG++NW+G A + L S P I K++F G-f E G+A+ ++ 
Sbjct: 194 EAGLPPGVINVVTGFGGAEVGEALVSHPDIDKISFTGSTEVGKAIMKAAAEKNLKPVT 253 

Query: 274 XXXXX - -XXXXDTADVDSAVEGWDAAWSDRGP GGLRLLIQESVWDEAMRRLQERMG 328 

D D+D AVE W A+ + G R+ +QES++DE + +i ER+ 

Sbjct: 254 LGGKNPVIVFEDADDLDKAVESWFGAFFNSGQVCTAASRIFVQESIYDEFVEKLVERVK 313 

Query: 329 RL - RSGRG- - LDGAVDMGAR- GAAACDLVQRFVREAQSQGAQVFQAGD VPSERPFY- 380 

+L + G LD DMG + +Q ++ EA+++GA++ G+ E ++ 

Sbjct: 314 KLLKVGEDDPLDPDTDMGPLINEEQYEKIQSYIEEAKAEGAKLVCGGERRKAGDEGGYFI 373 

Query: 381 PPTLVSNLPPASPCAQVEVPWPVVVASPFRT-AKEALLVANGTPRGGSASVWSERLGOAL 439 

PT+++++ Q E+ PV+ F+ EA+ +AN T G +A V++ + ~+A 

Sbjct: 374 QPT I LTD VTEDMRIMQEEI FGP VLP VIKFKDDLDEAI ELANDTE YGLAAGVFTRD I ERAQ 433 

Query: 440 ELGYGLQVGTVWINA HGLRDPSVPTGGCKESGCSWH-GGPDGLYEY 484 

+ L+ GTVW+N H + P GG K+SG GG GL EY 
Sbjct: 434 RVAERLEAGTVWVNDNIYHVSAEAQAPFGGYKQSGIGGREGGKYGLEEY 482 

Score = 262 (97.3 bits), Expect = 8.2e-20, P = 8.2e-20 
Identities = 86/301 (28%), Positives = 140/301 (46%) 
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Sbjct: 


61 


Query: 


151 


Sbjct: 


121 


Query: 


203 


Sbjct: 


181 


Query: 


261 


Sbjct: 


241 


Query: 


316 


Sbjct: 


301 


Query: 


372 


Sbjct: 


361 



RA+ L +LA+++++++ L LE+L TG+ + E + +V A L Y+A 



I S E + EP+GV+ I P F + +W+I PALA G T 



150 



E G PG++NW+G A + L S P I K++F G+ E G+A+ + 



SXXXXXXXXXXXXXXXXX- 
+ 



- XXXXDTADVDSAVEGWDAAWSDPGP - - -GGLRLLIQESV 315 
D D+D AVE W A+ + G R+ +QES+ 



+DE + +L ER+ +L + G 



LD DMG 



Fig. 5A 
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Score = 219 (82.2 bits). Expect = 4.9e-15, P = 4.9e-15 
Identities = 75/236 (31%) , Positives = 105/236 (44%) 
Query: 550 SSGNLHGYVAEGWAKDIRGAVEAMIQAFPG- -WAGQSP-GXXXXXXXXXXXXXERRKSTL 606 

Sbjct: 20 SgEvIAQ^TKE?™^^ 79 

Query: 607 M--MJWm^ 659 
80 A^ETMTGKPLAEAKVAEVAIUWDYLRYyAGMAEKIi^EETIPTSLSESPGSMSYTMRE 139 



OU AtVUCiXAJiaxvji.^ 

660 PI^VVCTDKIIPIIAFV^ 716 

mo SS&ii!^^ 199 

717 GLANWTG - DRDHLTRCLALHQDVQAMWYFGS AQ - Q^^^^^^^^p^^^Q 770 
200 ^GFGGAEVGEA^ 255 

fig. 5B 
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Input file Fbh21481FL.seq; Output File 21481. trans 
Sequence length 1379 

TTTGGCCCTCGAGGCCMGAATTCGGCACGAGGAGCAAGTGGCCTTAACACATGGATTTTCTTCCAAAAATGCAGACCC 
ATTTTAATTAAGTTTOTAATTA^ 

ACTTCCTATTTGTTGAACATTTGGGGATTAGCACGCCCACTGGGTGTTCAGCTTGGAGGCTTGCACAGAGCTGAGCTCC 

CTGCAGCCTTGGGCCTCCCCCTGCCCTGGGAGTCCTGATCAGCGTCTCTTTGCAAAGCCAATCCCCTTTTACTCCGTTG 

MGVMAMLMLPLLLLGI 16 

TCCCCCAGAACAAG ATG GGA GTC ATG GCC ATG CTG ATG CTC CCC CTG CTG CTG CTG GGA ATC 48 

n n ttutyOEVSRL-WSKSAVQ 36 

AGO GGC CTC CTC TTC ATT TAC CAA GAG GTG TCC AGG CTG TGG TCA AAG TCA GCT GTG CAG 108 

uuvTTDAlSGLGKECARV 56 

|L AAA m GTG GTG ATC ACC GAT GCC ATC TCA GGA CTG GGC AAG GAG TGT GCT CGG GTG 168 

TTC CAC ACA GGT GGG GCA AGG CTG G^G CTG TGT GGA AAG lL TGG GAG AGG CTA GAG AAC 228 

CTA TAT GAT GCC TTG ATC AGC GTG GCT GAC CCC AGC AAG ACA TTC ACC CCA AAG CTG G^C 288 
T r n T cnlSCVPDVAKEVLDCll6 
CTG WG GAC CTC TCA GAC ATC AGC TGT GTC CCA GAT GTG GCA AAA GAA GTC CTG GAT TGC 348 

/AT GGC TGT GTG GAC ATC CTC ATC AAC AAT GCC AGT g5Eg AAG GTG AAG G^G CCT GCC CAT 408 

nTCKIMDANYFGPIT 156 

AAG ATT TCT c'tG GAG CTC GAC AAA AAG ATC ATG GAT GCC AAT TAC TTT GGC CCC ATC ACA 468 

tpnmISRRTGQIVLV 176 

TTG ACG AAA GCC CTG CTT CCC AAC ATG ATC TCC CGG AGA ACA GGC CAA ATC GTG TTA GTG 528 

„ v t n C K F G I P F R T T Y A A S K H 196 
MT MT ATC cL GGG AAG TTT GGA ATC CCG TTC CGT ACG ACT TAC GCT GCC TCC AAG CAC 588 



Fig. 6A 
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AALGFFDCLRAEVEEYDVVI216 
GCA GCC CTG GGC TTC TTT GAC TGC CTC CGA GCC GAA GTG GAG GAA TAC GAT GTT GTC ATC 648 

ST VSPTFIRSYHVYPEQGNW236 
AGC ACC GTG AGC CCG ACT TTC ATC CGG TCG TAC CAC GTG TAT CCA GAG CAA GGA AAC TGG 708 

BASIWKFFFRKLTYGVHPVE256 
GAA GCT TCC ATT TGG AAA TTC TTT TTC AGG AAG CTG ACC TAC GGC GTG CAC CCA GTA GAG 768 

VAEEVMRTVRRKKQEVFMAN276 
GTG GCG GAG GAG GTG ATG CGC ACC GTG CGG AGG AAG AAG CAA GAG GTG TTT ATG GCC AAC 828 

PTPKAAVYVRTFFPEFFFAV296 
CCC ATC CCC AAG GCC GCC GTG TAC GTC CGC ACC TTC TTC CCG GAG TTC TTT TTC GCC GTG 888 

VACGVKEKLNVPEEG* 312 
GTG GCC TGT GGG GTG AAG GAG AAG CTC AAT GTC CCG GAG GAG GGG TAA 9ib 

CTGCAGGAGGCCAAATGGGC(]ACCCCTTGGAAATAAAGGTTTTTCTGGCAAAAAAAAAAAAAAAAAAAAANTTTGCGGC 
CGCAAGCTTATTCCCTTTAGGGAGGGTTAATTTT 

Fig. 6B 
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Signal Peptide Predictions for 21481 
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Score 


Mat@ 
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19 



-terminal 70aa used for signal peptide prediction 



Transmembrane Segments Predicted by MEMSAT 
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Protein Family / Domain Matches, HMMer Version 2 
Searching for complete domains in PFAM 
hmmpfam - search a single seq against HMM database 
HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) . 

HMM file : /prod/ddm/seqanal/PFAM/pf am5 . 0/Pf am 

Sequence file : /prod/ddm/wspace/orf anal/oa- script . 9650 . seq 

Query: 21481 

Scores for sequence family classification (score includes all domains) : 
Model Description ~ Score E-value N 



adh_short short chain dehydrogenase 120.0 4.5e-32 1 

A2M Alpha-2-macroglobulin family 0.5 7.1 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



adh short 1/1 38 227 .. -1 203- []- 120.0 4.5e-32 — 

A2M" 1/1 278 291 .. 1 14 [. 0.5 7.1 

Alignments of top-scoring domains: 

adh short: domain 1 of 1, from 38 to 227: score 120.0, E = 4.5e-32 

* - >KvaLvTGassGIGlaiAkrLakeGakVwadrneeklekGavakelk 
Kv+++T a SG+G+++A+ +++ Ga++v+++ n e+le+ ++1 
21481 38 KVWITDAISGLGKECARVFHTGGARLVLCGKNWERLEN--LYDALI 82_ 

elGgnd . . kdralaiqlDvtdeesv . aaveqaverlGrlDvLVNNAGgii ~~ " 
+++++ + lD++d + V+++++++++ +G +D+L+NNA + — 

21481 83 SV- ADPskTFTPKLVLLDLSDISCVpDVAKEVLDCYGCVDILINNAS - - V 129 

llrpgpfaelsrtmeedwdrvidvNltgvflltravlplmamkkrggGrl 
gp ++++s +e + ++++d N++g++ lt+a+lp m+ r+ G I 
21481 130 -KVKGPAHKIS LELDKKIMDANYFGPITLTKALLP--NMISRRTGQI 173 

vNiSSvaGrkegglvgvpggsaYsASKaAvigltrsLAlElaphglrVna 
v + + G + g p+++ Y+ASK+A g+ ++L+ E+ ++ + ++ 
21481 174 VLVNNIQG KFGIPFRTTYAASKHAALGFFDCLRAEVEEYDWIST 218 

VAPGgvdTd<-* 
v+P +++ 
21481 219 VSPTFIRSY 227 

A2M: domain 1 of 1, from 278 to 291: score 0.5, E = 7.1 

*->idedditiRSyFPE<-* 
i+ + +R++FPE 
21481 278 I PKAAVYVRTFFPE 291 



Fig. 9 
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ProDom Matches 



Prodomld 



View Prodom 1 1 1 Boxer 



Showing match 



Go!] 



Prodomld 



Start 



99 



Start 



End 



219 



End 



Description 



p99.2 (1078) ADH(34) GALE(20) FABG(13)// 
OXIDOREDUCTASE PROTEIN DEHYDROGENASE 
NAD REDUCTASE NADP BIOSYNTHESIS 
SYNTHASE ALCOHOL PUTATIVE _ 



Description 



Score 



113 



Score 



View Prodom 11 I Boxer H 1 Showing match \*\ \Gol 



>11 P99.2 (1078) ADH(34) GALE (20) FABG (13 ) // OXIDOREDUCTASE PROTEIN 

DEHYDROGENASE NAD REDUCTASE NADP BIOSYNTHESIS SYNTHASE ALCOHOL PUTATIVE 
Length =269 

Score = 113 (44,8 bits), Expect = 0.00016, P = 0.00016 
Identities = 41/138 (29%), Positives = 63/138 (45%) 



Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 



99 DLSDIS-CVPDVAKEVLDCYGCVDILINNASVKV-KGPAHKISLELD KKIMDANY 151 

D+ D+ V V +E +G +D+L+NNA V K A ++ E +++++ N 

87 DVEDVEKLVETWEEF SGIHGKI DVLVNNAGVMAPKAVAE SMTEETSDDEEWEEVI EVNV 146 

152 FGPITLTKALLPNMIS RRTGQIVLVNNIQGK - FGIP - FRTTYAASKHAALGF 201 

G LT+A LP M R G IV V ++ G G P + Y+ASK A F 

147 TGTFNLTQAALPAMKKFSDAAAKKRFVGTIVNVASVAGSTMGSPGSQAAYSASKAAVESF 206 

202 FDCLRAEVEEYDWI STV 219 

L E+ Y ++ V 
207 TKSLAMELS P YS AS VAMV 224 



Fig. 10 
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Input file Fbh25964Fl.seq? Output File 25964. trans 
Sequence length 1725 

GAGMGGAGGAGCCAGCGGAAGGACGGTGTGCG^^ 

TGGGCACCCTGGGATAGCGGCTGCAGCCATCAGCAGGGGCAGACGGCAGGTGGCCTGGTTGCTGCAGCTCCCAGGATCA 

GCTCTGCCCTCCCCGCAAACGCCAGCCTCGTCACCGCTCCAGGGCACCK^^ 

MADSAQAQK 9 
CAGCCAGCCCCTGGATGAGCCAAGGTCTCTTCCCCAGCCAGGC ATG GCC GAC TCT GCA CAG GCC CAG AAG 27 

ruvTVTGGCGFLGEHVVRML 29 
CTG GTG TAC CTG GTC ACA GGG GGC TGT GGC TTC CTG GGA GAG CAC GTG GTG CGA ATG CTG 87 



CTG CAG CGG GAG CCC CGG CTC GGG GAG CTG CGG GTC TTC GAC CAA CAC CTG GGT CCC TGG 



VFDQHLGPW 49 



CTG GAG GAG CTG AAG ACA G^G CCT gIg AGG GTG ACT GCC ATC do GGG GAC GTG ACC CAG 

. rrvAAAVAGAHVVIHTAGL 89 

GCC <£t GAG GTG GCA GCA GCT GTG GCC GGA GCC CAT GTG GTC ATC CAC ACQ OCT GGG CTG « > 

„nv*«RASPKTIHEVNVQGT 109 
QTA GAC G^G TTT GGC AGG GCC AGT CCC AAG ACC ATC CAT GAG GTC AAC GTG CAG GGT ACC 

p M v I E A C V Q T G T R F L V Y T S S 129 

CGG AAC GTG ATC GAG GCT TGT GTG CAG ACC GGA ACA CGG TTC CTG GTC TAC ACC AGC AGC 387 

AW GAA GTC G^G GGG CCT MC ACC aL G^T CAC CCC TTC TAC AGG GGC A^C gL GAC ACC 447 

nvuaVHRHPYPCSKALAEWL 169 

CCA tL GAA GCA GTG CAC AGG CAC CCC TAT CCT TGC AGC AAG GCC CTG GCC GAG TGG CTG 507 

T oiNGRKVRGGLPLVTCAL 189 

GTC CTG GAG GCC AAC GGG AGG AAG GTC CGT GGG GGG CTG CCC CTG GTG ACG TGT GCC CTT 567 
p p T G I Y G E G H Q I M R D F Y R Q G 209 
CGT CCC ACG GGC ATC TAC GGT GAA GGC CAC CAG ATC ATG AGG GAC TTC TAC CGC CAG GGC 627 

CTG CGC CTG GGA GCT 4 CTC TTC CGG GCC ATC CCG GCC TCT GTG GAG it GGC CGG GTC *87 



147 



VTAIQGDVTQ 69 



207 

89 
267 

109 
327 

129 
387 

149 
447 

169 
507 

189 
567 

209 
627 

229 
687 
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YVGNVAWMHVLAARELEQRA 249 
TAT GTG GGC AAT GTT GCC TGG ATG CAC GTG CTG GCA GCC CGG GAG CTG GAG CAG CGG GCA 747 

1TMGG0VYFC YDGSPYRSYE 269 
GCC CTG ATG GGC GGC CAG GTA TAC TTC TGC TAC GAT GGA TCA CCC TAC AGG AGC TAC GAG 807 

nPNMEFLGPCGLRLVGARPL 289 
GAT TTC AAC ATG GAG TTC CTG GGC CCC TGC GGA CTG CGG CTG GTG GGC GCC CGC CCA TTG 867 

TpvWLLVFLAALNALLQWLL 309 
CTG CCC TAC TGG CTG CTG GTG TTC CTG GCT GCC CTC AAT GCC CTG CTG CAG TGG CTG CTG 927 

PPLVLYAPLLNPYTLAVANT 329 
CGG CCA CTG GTG CTC TAC GCA CCC CTG CTG AAC CCC TAC ACG CTG GCC GTG GCC AAC ACC 987 

TFTVSTDKAQRHFGYEPLFS 349 
ACC TTC ACC GTC AGC ACC GAC AAG GCT CAG CGC CAT TTC GGC TAT GAG CCC CTG TTC TCG 1047 

MtjncRT RTILWVQAATGSAQ 369 
TGG GAG GAT AGC CGG ACC CGC ACC ATT CTC TGG GTA CAG GCC GCT ACG GGT TCA GCC CAG 1107 

' 370 

* 1110 

TGA 

C(K3T(^TGGGGCC^^ 

CAGCTGCATTCCAGAGCAGGAGGCAGGGCTCTGGGGCCAGAATGGCTGTCCTTGTCGTAGAGCCCTCCACATTTTCTTT 
TTCTTTTTTGAGACAGGGTCTTGCTCTGTCACCCAGACTGGAATGCAAGTGGTGTGANTCATAAGCTCACTNGHACCCT 
YAMCCTTCTGGGTTC^GCAATCCTTNCTNGGCTYAMCCTTCTNGAACAAGCTTGGGANCCACAGGTGCACGCCANC 

CACANCCTGGCTTTTTTTT 
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Transmembrane Segments Predicted by MEMSAT 
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Protein Family / Domain Matches, HMMer Version 2 

Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

bright' W) (D 199"l998 Washington University School of Medicine 
Sr is freely distributed under the GNU General Public_License_ (GPL) 

/prod/ddm/seqanal/PFAM/pf am5 . O/Pf am 
/prod/ddm/wspace/orfanal/oa- script. 92 89. seq 



HMM file: 
Sequence file: 



Query: 25964 

Scores for sequence family classification (score includes all domains)^ ^ 
Model Description ° 

3-beta hydroxysteroid dehydrogenase/ i so 676.9 

S-adenosylmethionine synthetase 1.8 

short chain dehydrogenase -«.6 

NAD dependent epimerase/dehydratase fam -148.0 



3Beta HSD 
S-AdoMet synt 
adh short 
Epimerase 



le-199 1 

0.78 1 

0.022 1 

0.0016 1 



Parsed for domains: 

Model Domain "seq-f seq-t 



hmm-f hmm-t 



adhshort 
S-AdoMet_synt 
3Beta_HSD 
Epimerase 



1/1 
1/1 
1/1 
1/1 



10 


197 .. 


1 


203 


[] 


-48.6 


341 


351 .. 


365 


376 


.1 


1.8 


1 


365 [. 


1 


425 


[1 


676.9 


12 


365 .. 


1 


359 


[1 


-148.0 



score E-value 

0.022 
0.78 
le-199 
0.0016 



Alionments of top-scoring domains: . .„ 

v LvTG+++ +G +++ L+ + ' ++ ++ + G +++elk 
LVYLVTGGCGFLGEHWRMLLQR - - EPRLGELRVFDQHLGPWLEELK 54 

elGgndkdralaiqlDvtdeesv . aaveqaverlGrlDvLVNNAGgiill 
+ r+ aiq+Dvt++ +v aav+ a +v++ AG + 

55 TG pv RVTAIQGDVTQAHEVaAAVAGA HWIHTAG- - L- - 89 

rpgpfaelsrtmeedwdrvidvNltgvflltravlplmamkkrggGrlvN 
+ f + s t+ +++vN+ g tr v++ a ++ g v 
90 -VDVFGRAS PK TIHEVNVQG TRNVIE- -ACVQTGTRFLVY 126 



25964 10 



25964 



25964 



25964 



iSSvaGrke 9 • glvgvpggsaYsASKaAvigl trs 

+SS +e ++++++++ +++ + + ++ +Y +SKa }** 1C0 
127 TSS MEwgpntkghpf yrgnEdTPYEAVHRHPYPCSKA LAEW 168 



LAlElaphglr VnavaPGgvdTd< - * 

L IE +++r++ + a P g++ + 
25964 169 LVLEANGRKVRgg lpl vTCALRPTGIYGE 197 

S-AdoMet synt: domain 1 of 1, from 341 to 351: score 1.8, E = 0.78 

* - >HFGreevdFpWE< - * 

HFG e F+WE CSq -(A A 

25964 341 HFGYEP-LFSWE 351 1 'O' ™ 
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3Beta HSD: domain 1 of 1, from 1 to 365: score 676.9, E = le-199 

- *. >e lsesldmaglsclVTGGgGFlGrhIVreLlregeslqevRvfDlrf 

l++lVTGG+GFlG+h Vr+Ll+++++1 e+Rvf D + 



25964 



25964 



+++S++ l++lVTGG+GFlG+h Vr+Ll+++++1 e+RvfD + 
-MADS AQAQKLVYLVTGGCGFLGEHVVRMLLQREPRLGELRVFDQHL 4 6 

spelde . dssklqvitkikyieGDvtDkqdlaaAlqgiSCCTLLDmTLmD 
+p+l+e +++++ v+ +i+GDvt+++++aaA++g+ 
47 GPWLEE1KTGPVRVT AIQGDVTQAHEVAAAVAGA 80 

dwIHtAaiiDvf GelrvsGSDLSFGVTVLFLAVTEGSYWFYmGATDLR 
+wIHtA+++DvfG 

25964 81 HWIHTAGLVDVFG - 94 

kasrdrimkVNVkGTqnvldACveaGVrvlVYTSSmeWGpNsrGqpivN 
as+ +i++VNV+GT+nv++ACv++G+r+lVYTSSmeWGpN +G+p+++ 
25964 95 RAS PKT IHEVNVQGTRNVI E ACV QTGTRFLVYT S SME WGPNTKGHP F YR 144 

GdEttpYestDDhqdaYpeSKalAEklVLkANGsmlknGgrLyTCALRPa 
G+E+tpYe++ h+++Yp+SKalAE 1VL+ANG+ +++G L+TCALRP+ 
25964 145 GNEDT P YE AV - - HRHP Y PCS KAL AEWLVLE ANGRKVRGGL PLVTC ALRPT 192 

gl f GeGdqf 1 vp f lrql vknGl akf r iGdknalsdrVYVgNVAwAHILAA 
gl+GeG q + +f+rq +++G+ + fr ++ + rVYVgNC Aw+ H + LAA 
25964 193 GIYGEGHQIMRDFYRQGLRLGGWLFRAIPASVEHGRVYVGNVAWMHVLAA 242 

raLqdpkkGREGassiaGqaYFIsDdsPvnSYddFnrtllkalGlrlpst 
r+L+++ a+ + Gq+YF++D+sP++SY+dFn+++l ++Glrl + 

25964 243 RELEQR aalMGGQVYFCYDGSPYRSYEDFNMEFLGPCGLRLVGA 286 

w rlPlpllyvlaylnellswLLrklalrYtPllnpytvtlanttFtfst 
++lP++ll++la+ln+ll+wLLr+l + Y Pllnpyt+++anttFt+st 
25964 287 RpLLPYWLLVFLAALNALLQWLLRPL-VLYAPLLNPYTLAVANTTFTVST 335 

nKAkkdLGYePlvtwEEarakTieWiqele< - * 
+KA++++GYePl++wE +r +Ti+W+q+ 
25964 336 DKAQRHFGYEPLFSWEDSRTRTILWVQAAT 365 



Fig. 14B-1 
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Epimerase- domain 1 of 1, from 12 to 365: score -148.0, E = 0.0016 
Epimerase . domain ^ ^ >iLVTGGAGFIGShlvreLlnn . . . ygddkVwLDnLtdyYqyagnea 

+LVTGG GF G h+vr Li+ +++ g +V + + + 

25964 12 YLVTGGCGFLGEHWRMLLQReprLGELRV FD QHLGPW 49 

. rlewegnprytFvkGDIcDrdlldkvfaehqpDaViHfAAeshV.drSi- 

++e + g r+t ++GD+ + + ++a +ViH A++ V +r 
25964 50 LEELKTGPWVTAIQGDVTQAHEVAAAVAGA- -HWIHTAGLVDVf GR- - 95 

e kPlayidtNvvGTltLLEaaRnYWsaLdetkagvkkfvfsSTdeVYGdl 
+Ea+ g +v+ S+ eV G + 



- QTGTRFLVYTS SMEWGPN 136 



P + + Nv GT + 
25964 96 ASPKTIHEVNVQGTRNVIEACV- 

esiPisaF. . .tEdtPynPs. .SPYgaSKassEllvrayhraygLpaiiL 
++ + F ++ EdtPy ++ PY SKa E lv + 
25964 137 TKGHP- -FyrgNEDTPYEAVhrHPYPCSKALAEWLVLEAN 



174 



25964 



25964 



25964 



25964 



25964 



RyFNvYGpyqsgriGedpngfpekLIPliiqnalgkgeplpvYGdDYpTp 
~ ~ + pi + + al p +YG 

LPLV-TCALR PTGIYG- 196 



G+ g+ 
175 -GRKVRGG 



DGtqv.RDw. 



, . . ihVeDharANhllaltkg 

++ + + + ++++++V ++a h+la +++ 



197 ECTQIn^FyrqglrlggwlfraipasvehgrVYVGNVAVM-HVIAMiEL 245 



raGkgsevYNiGg 

++ + + +G 



285 



+++ +++ ++++ ++++++ 
246 eqraalmggqvyfcydgspyrsyedfnmeflgpcglrLVG 

gneysnlEvVealekllgelaPekphvkakedpatf vddRpGddarya . . 
+ + + + +++++1+ 1 + ++ +++ ++ ++a oo „ 

286 - ARPLLP YWLLVFLAALNALLQWL LRPLVttiAPLLN- -PYTLAva 327 

a DasKikreLGWkPevtnleeGladTvnWylene<-* 

+ ++ +++ +K++I G++P + e+ +T+ W + 
328 n 1 1 f t VS TDKAQRHFGYE PLF S -WED S RTRT I LWVQ AAT 365 
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Prodomld 


Start 


End 


Description 


Score 




11 


362 


P 99.2 (39) 3BHS(5) 3BH1(4) 3BH2(3) // DEHYDROGENASE 
STEROID BETA-HYDROXYSTEROID 3BETA-HSD 
DEHYDROGENASE/DELTA 5->4-ISOMERASE INCLUDES: 
PROGESTERONE 3-BETA-HYDROXY-DELTA5-STEROD 
3-BETA-HYDROXY-5-ENE 


395 


View Prodom 1280 Boxer 


| Showing match | T ||Go! 




Prodomld 


Start 


End 


Description 


Score 



View Prodom 1280 |Boxer|T| I Showing match H [GoTj 

>1280 d99 2 (39) 3BHS(5) 3BH1(4) 3BH2(3) // DEHYDROGENASE STEROID 

BETA-HYDROXYSTEROID 3BETA-HSD DEHYDROGENASE/DELTA 5-->4-ISOMERASE 
INCLDD^: PROGESTERONE 3 -BETA- HYDROXY -DELTAS -STEROID 3 - BETA - HYDROXY - 5 - ENE 
Length = 416 

Score = 395 (144.1 bits), Expect = 3.2e-42, , Sum P(2) = 3.2e-42 
Identities = 99/268 (36%) , Positives = 134/268 (50%) 

Ouerv 102 HEVNVQGTRNVIEACVQTGTRFLVYTSSMEWGPNTKGHPFYRGNEDTPYEAVHRHPYPC 161 

* ++ NVOGTRN+IE C RP MEV GPN+ G+E+ +E+ +PYP 

Sbjct: 157 YKFNVQGTRNLIEKC RFF — GVMEVAGPNSYKEIILNGHEEEHHESTWPNPYPY 208 

Ouerv- 162 -SKJUiAEVnjVLFJMGRKVRGGLPLVTCALRPTGIYGEG 220 
yuery. ^ ^ ^ ++ Q L TCALRp iyGEG + + q l+ GG +FR 

Sbjct: 209 YSKKMAEKAVLAAN^SMLKNGGTLYTCALRPMYIYGEGDKFLSPMIVQALKNGGIMFRVG 268 

Query- 221 PASVEHGRVYVGNVAWMHVXXXXXXXXXXXX - -MGGQVYFCYDGSPYRSYEDFNMEFLGP 278 

1 ' VYVGNVAW H+ + GQ Y+ D +P++SY+D N 

Sbjct: 269 GKFSVANPVYVGNVAWAHILAARGLQDPKKSPNIQGQFYYISDDTPHQSYDDLNYTLSKE 328 

Query: 279 CGLRLVGARPLLP- - - -YWXXXXXXXXXXXXXXXXXXXXXXXXXXNP^ 334 

GLRL ++ LP YW N + + b 

Sbjct: 329 WGLRLDSSKWRLPLPLLYWLAFLLEMVSFLLRPISYNYQPPF NRHLVTLSNTTFTFS 385 

Query: 335 TDKAQRHFGYEPLFSWEDSRTRTI LWVQ 362 

KAQR GYEPL SWE+++ +T W++ 
Sbjct: 386 YKKAQRDLGYEPLVSWEEAKQKTSEWIE 413 

Score = 65 (27.9 bits), Expect = 3.2e-42, Sum P(2) = 3.2e-42 
Identities = 11/23 (47%) , Positives = 17/23 (73%) 

Query: 11 VYLVTGGCGFLGEHWRMLLQRE 33 

VY VTGG FLG ++V++L+ + 
Sbjct: 14 VYAVTGGAEFLGRYIVKLLISAD 36 
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Input file Fbh21686Fl.seq; Output File 21686. trans 
Sequence length 1209 

M S L R • 

CCCACGCGTCCGCCCACGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGCCCGCCTCGA ATG TCC CTG AGA 12 

dpraCAOLLWHPAAGMASWA 24 
CCC AGA AGG GCC TGC GCT CAG CTG CTC TGG CAC CCC GCT GCA GGG ATG GCC TCC TGG GCT 72 

yppqYLAPGLLQGQVAIVTG 44 
AAG GGC AGG AGC TAC CTG GCG CCT GGT TTG CTG CAG GGC CAA GTG GCC ATC GTC ACC GGC 132 

riTGTGKAIV KELLELGSNV 64 
GGG GCC ACG GGC ATC GGA AAA GCC ATC GTG AAG GAG CTC CTG GAG CTG GGG AGT AAT GTG 192 

vti<?RKLERLKSAADELQAN 84 
GTC ATT GCA TCC CGT AAG TTG GAG AGA TTG AAG TCT GCG GCA GAT GAA CTG CAG GCC AAC 252 
TDPTKOARVIPIQCNIRNEE104 
CTA CCT CCC ACA AAG CAG GCA CGA GTC ATT CCC ATA CAA TGC AAC ATC CGG AAT GAG GAG 312 
OVNMLVKSTLDTFGKINFLV124 
GAG GTG AAT AAT TTG GTC AAA TCT ACC TTA GAT ACT TTT GGT AAG ATC AAT TTC TTG GTG 372 
MNfCGOFLSPAEHISSKGWH 144 
AAC AAT GGA GGA GGC CAG TTT CTT TCC CCT GCT GAA CAC ATC AGT TCT AAG GGA TGG CAC 432 
» w T RTNLTGTFYMCKAVYSS164 
GCT GTG CTT GAG ACC AAC CTG ACG GGT ACC TTC TAC ATG TGC AAA GCA GTT TAC AGC TCC 492 
W Miri?HGGSIVNIIVPTKAGFl84 
TGG ATG AAA GAG CAT GGA GGA TCT ATC GTC AAT ATC ATT GTC CCT ACT AAA GCT GGA TTT 552 

D i iVHSGAARAGVY _N L T K S L 204 
CCA TTA GCT GTG CAT TCT GGA GCT GCA AGA GCA GGT GTT TAC AAC CTC ACC AAA TCT TTA 612 

a TEWACSGIRINCVAPGVIY224 
GCT TTG GAA TGG GCC TGC AGT GGA ATA CGG ATC AAT TGT GTT GCC CCT GGA GTT ATT TAT 672 

QOTAVENYGSWGQSFFEGSF244 
TCC CAG ACT GCT GTG GAG AAC TAT GGT TCC TGG GGA CAA AGC TTC TTT GAA GGG TCT TTT 732 

niTTPAKRIGVPEEVSSVVCF264 
CAG_AAA ATC CCC GCT AAA CGA ATT GGT GTT CCT GAG GAG GTC TCC TCT GTG GTC TGC TTC 792 

tt opAASFITGQSVDVDGGR284 
CTA CTG TCT CCT GCA GCT TCC TTC ATC ACT GGA CAG TCG GTG GAT GTG GAT GGG GGC CGG 852 

qt YTHSYEVPDHDNWPKGAG 304 
AGT CTC TAT ACT CAC TCG TAT GAG GTA CCA GAT CAT GAC AAC TGG CCC AAG GGA GCA GGG 912 

nT cvVKKMKETLKEKAKL* 323 
GAC CTT TCT GTT GTC AAA AAG ATG AAG GAG ACC TTA AAG GAG AAA GCT AAG CTC TGA 969 
GCTGAGGAAACAAGGTGTCCTCCATCCCCAGTGCCTTCACATCTTGAGGATATGCTTCTGTACTTTTTAAAAGCTTATA 
GTTGGTATGGAAAACATTTTTCTTATTTTTAAGTGTTATTAATTATATCTATGGAAAAACTATTCCTGAAATATATACA 

GTCTTATGTCCCAAAAAAAAAA FiQ 16 
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CLUSTAL W (1.74) multiple sequence alignment 



5052204 SDR rat ----- - MGSWKSGQS YLAAGLLQNQVAWTGGATGI GKAI SRELLHL 

21686 " " MSLRPRRACAQLLWHPAAGMASWAKGRSYLAPGLLQGQVAIVTGGATGIGKAIVKELLEL 

*** *.****,****,***.************ .***,* 

5052204 SDR rat GCNWIASRKLDRLTAAVDELRASQPPSSSTQVTAIQCNIRKEEEVHNLVKSTLAKYGKI 
21686 ~ ~ GSNWIASRKLERLKSAADELQANLPPTKQARVIPIQCNIRNEEEVNNLVKSTLDTFGKI 

*,*********:**. :*.***:*. **:..::* .******;************ .:*** 

5052204 SDR rat NFLVNNAGGQFMAPAEDITAKGWQAVIETNLTGTFYMCKAVYNSWMKDHGGSIVNIIVLL 
21686 " ~~ NFLVNNGGGQFLSPAEHISSKGWHAVLETNLTGTFYMCKAVYSSWMKEHGGSIVNIIVPT 

****** ****..***,*;.***:**:*************** ,**** : ********** 

5052204 SDR rat NNGFPTAAHSGAARAGVYNLTKTMALTWASSGVRINCVAPGTIYSQTAVDNYGELGQTMF 
91686 " ~ KAGFPLAVHSGAARAGVYNLTKSLALEWACSGIRINCVAPGVIYSQTAVENYGSWGQSFF 

. *** * **************..** **.**;********.*******:***. **::* 

« • • • 

5052204 SDR rat EMAFENIPAKRVGLPEEISPLVCFLLSPAASFITGQLINVDGGQALYTRNFTIPDHDNWP 
21686 ~ ~~ EGSFQKIPAKRIGVPEEVSSWCFLLSPAASFITGQSVDVDGGRSLYTHSYEVPDHDNWP 

* .*..*****.*.***;*. ;**************-* ::****::***:.: :******* 

5052204 SDR rat VGAGDSSFIKKVKESLKKQARL 
21686 ~ " KGAGDLSWKKMKETLKEKAKL 

**** *.**.**;**;;*:* 
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r 

Signal Peptide Predictions for 21686 



Method 


Predict |Score|[Mat@ 


SigaaUeukaryote) 


MAYBE \ || 2:0 



Note: ami no-terminal 70aa used for signal peptide prediction 



Transmembrane Segments Predicted by MEMSAT 



Start I End 


Orient || Score 


29 |50 


ins~>out|| 0.9 


170 1 188 


out— >ins]| 0.2 


208J224 


ins— >outj| 0.6 


258 |275 


out— >ins|| 2.6 



>21686 

MSLRPRRACAQLLWHPAAGMASWAKGRAYLAPGLLQGQVAIVTGGATGIGKAIVKELLEL 
GSNWIASRKLERLKSAADELQANLPPTKQARVIPIQCNIRNEEEVNNLVKSTLDTFGKI 
NFLVNNGGGQFLSPAEHISSKGWHAVLETNLTGTFYMCKAVYSSWMKEHGGSIVNIIVPT 
KAGFPLAVHSGAARAGVYNLTKSLALEWACSGIRINCVAPGVIYSQTAVENYGSWGQSFF 
EGSFQKIPAKRIGVPEEVSSWCFLLSPAASFITGQSVDVDGGRSLYTHSYEVPDHDNWP 
KGAGDLSWKKMKETLKEKAKL 



Transmembrane Segments for Presumed Mature Peptide 



Start ||End 


Orient || Score 


10 J31 


ins~>out 0.9 


151 | 169 


out->ins|0.2 


189 |205 


ins— >out|| 0.6 


239 f256 


out— >ins|| 2.6 



>2 1 6 8 6_mature 

MASWAKGRSYLAPGLLQGQVAIVTGGATGIGKAIVKELLELGSNWIASRKLERLKSAAD 
ELQANLPPTKQARVIPIQCNIRNEEEVNNLVKSTLDTFGKINFLVNNGGGQFLSPAEHIS 
SKGWHAVLETNLTGTFYMCKAVYSSWMKEHGGSIVNIIVPTKAGFPLAVHSGAARAGNYN 
LTKSLALEWACSGIRINCVAPGVIYSQTAVENYGSWGQSFFEGSFQKIPAKRIGVPEEVS 
SWCFLLSPAASFITGQSVDVDGGRSLYTHSYEVPDHDNWPKGAGDLSWKKMKETLKEK 
AKL 
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Protein Family / Domain Matches, HMMer Version 2 

Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

-HMMER 2.1.1 (Dec 1998) ~ ' ' . 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER is freely distributed under the GNU General Public License (GPL) . 

HMM file: /prod/ddm/seqanal/PFAM/pf am4 .4/Pfam 

Sequence file: /prod/ddm/wspace/orfanal/oa- script. 19160 .seq 

Query: 21686 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



adh short short chain dehydrogenase 162.5 ^-^ e "^ } 

adh short C2 short chain dehydrogenase /reductase C-te 47.2 3.7e-10 l 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



adh short 1/1 38 226.. 1 203 t] 162.5 7.3e-45 

adlTshort_C2 1/1 250 280 .. 1 31 [] 47.2 3.7e-10 

Alignments of top-scoring domains: 

adh short: domain 1 of 1, from 38 to 226: score 162.5, E = 7-3e-45 

* - >KvaLvTGassGIGlaiAkrLakeGakVvvadrneeklekGavakelk 

+va+vTG++ GIG+ai+k+L++ G +Vv+a r e+1 ++++ 
21686 38 QVAIVTGGATGIGKAIVKELLELGSNWIASRKLERL KSAAD 79 

elGgnd kdralaiqlDvtdeesv . aaveqavcrlGrlDvLVNNAGg 

el +n+++++ r+++iq++++ ee+v+++v+ ++ +G+++ LVNN Gg 
21686 80 ELQANLpptkQARVIPIQCNIRNEEEVnNLVKSTLD'i'FGKINFLVNNGGG 129 

. i i llrpgp f aelsr tmeedwdrvidvNl tgvf 1 1 travlplmamkkrgg 
+++ p++ +s + w +v+++Nltg+f++++av +k +g 
21686 130 qFL SPAEHIS — SKGWHAVLETNLTGTFYMCKAVYS - - SWMKEHG 170 

GrlvNiSSvaGrkegglvgvpggsaYsASKaAvigltrsLAlElaphglr 
G+ivNi + g+p ++ +A+ a+v lt+sLAlE+a glr 

21686 171 GSIVNIIV-PT KAGFPLAVHSGAARAGVYNLTKSLALEWACSGIR 214 

VnavaPGgvdTd<-* 
+n+vaPG ++ + 
21686 215 INCVAPGVIYSQ 226 

adh short C2: domain 1 of 1, from 250 to 280: score 47.2, E = 3.7e-10 

* - >gRlGePeEiAnawFLASdaAsYiTGqtlvV< - * 

+R G PeE++++v FL S+aAs+iTGq + V 
21686 250 KRIGVPEEVSSWCFLLSPAASFITGQSVDV 280 
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Prodomld 


Start 


End 


Description 


Score 


View Prodom 1216221 Boxer !▼ 


29 


82 


p99.2 ( 1 ) YS05 CAEEL // HYPOTHETICAL 98.0 KD 
PROTEIN F56D1.5 IN CHROMOSOME II 
TRANSMEMBRANE 


70 


| Showing match M Go! 


View rroaom Vjjui doacl i i 


35 


82 


p99.2 (1)027957 ARCFU // SHIKIMATE 
5-DEHYDROGENASE AROE HYPOTHETICAL 
PROTEIN 


86 


| Showing match !▼ |Go!| 


View Prodom 1 1 Boxer | ▼ | 


37 


231 


p99.2 (1078) ADH(34) GALE(20) FABG(13) // 
OXIDOREDUCTASE PROTEIN DEHYDROGENASE 
MAD RFni JfTASF NADP BIOSYNTHESIS 
SYNTHASE ALCOHOL PUTATIVE 


157 


1 ^hruA/infr mntrh !▼ IGo'I 

I OlivJVYlIle^ llldL^ll 1 |v_»vy .j 




View Prodom 737531 Boxer M 


237 


286 


p99.2(l)P71079 BACSU // UNIDENTFIED 
DEHYDROGENASE 


84 


| Showing match | T ||Go!j 


Vie.w Prodom 772231 Boxer T | 


243 


287 


p992 (1)007882 STAXY// 
GLUCOSE- 1-DEHYDROGENASE 


92 


Showing match " HlGo!| 


Prodomld 


Start 


End 


Description 


Score 



View Prodom 11 | Boxer \r\ \ Showing match !▼[ jGo 1| 

^11 nQQ 9 M078) ADH(34) GALE (20) FABG ( 13 ) // OXIDOREDUCTASE PROTEIN 

DEHYDROGENASE^AD REDUCTASE NADP BIOSYNTHESIA SYNTHASE ALCOHOL PUTATIVE 
Length =269 



Score = 157 (60-3 bits), Expect = L2e-09, P = l;2e-09 
Identities = 64/213 (~0%) , Positives = 106/213 (49%) 

51 KAIVKELLELGSNWIASRKLERLKSAADELQANLPPTKQA RVIPIQCNIRNEEEVN 107 

K +V S AS+ E ++A+TQA V + C++ + E+V 

35 KVWVSATSESGESTEASK- - ESAMEVSKAWAEVSATMQAVGVTVTKVTCDVADVEDVE 92 

108 NLVKSTLDTF -™ - GKINFLVNNGGGQFLSP AEHI S SKG WHAVLETNLTGTF 155 

LV++ ++ F GKI+ LVNN G ++P AE ++ + W V+E N+TGTF 

93 KLVETWEEFSGIHGKIDVLVNNAG- -VMAPKAVAESMTEETSDDEEWEEVIEVNVTGTF 150 

EHGGSIVNI - - IVPTKAGFP - - LAVHS GAARAGVYNLTKS 203 
G+IVN+ + + G P A +S A++A V + TKS 



Query: 
Sbjct: 
Query : 
Sbjct: 
Query: 
Sbjct: 
Query : 
Sbjct: 



156 YMCKAVYS SWMK 

A . K G+IVN+ + + G P A +S A++A V + TKS 

151 NLTQAALPAMKKFSDAAAKKRFVGTIVNVASVAGSTMGSPGSQAAYS-ASKAAVESFTKS 209 

204 LALE WACSG- -IRINCVAPGVIYSQTAVEN 231 

LA+E ++ S +R+N VAPG + + A+E+ 
210 LAMELSPYSASVAMVRVNAVAPGYVETD-ALES 241 

Score = 103 (41.3 bits), Expect = 0.0021, Sum P(2) = 0.0021 
Identities = 32/100 (32%), Positives = 54/100 (54%) 

Ouerv- 37 GQVAIVTGGA- -TGIGKAIVKELLELGSNWIASRKLERLKS- -AADE LQAN 84 

G+ +VTGG+ +GIG AI ++L E G+ W+ S E +S A+ E + A 

Sbjct: 7 GKTVLVTGGSGFSGIGLAIARQLAEEGAKVVVVSATSEESESTEASKESAMEVSKAVNAE 66 

Ouerv- 85 LPPTKQA RVIPIQCNIRNEEEVNNLVKSTLDTFGKIN 121 

V y " + T QA V + C++ + E+V LV++ ++ F 1+ 

Sbjct: 67 VS ATMQ AVGVTVTKVTCDVADVEDVEKLVETVVEEF S G I H 106 

Score = 37 (18.1 bits), Expect= 0.0021, Sum P(2) = 0.0021 
Identities = 9/23 (39%), Positives = 13/23 (56%) 

Query: 205 ALEWACSGIRINCVAPGVIYSQT 227 

ALE A +G+ + V PG + T 
Sbjct: 238 ALES ATNGL S WTVR P GNVRVNT 260 
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View Prodom 77223 I Boxer M | Showing match \r\ [Go 



>77223 p99.2 (1) O07882STAXY // GLUCOSE- 1 -DEHYDROGENASE 
Length =67 

-Score = 92 (37.4 bits), Expect =0.00031, P= 0.00031 
Identities = 19/45 (42%), Positives = 29/45 (64%) 

Ouerv: 243 SFQKIPAKRIGVPEEVSSWCFLLSPAASFITGQSVDVDGGRSLY 287 
W y + i IPAK IG ++V++V FL S A +1 G ++ VDGG + Y 

Sbjct- 15 TLEMIPAKEIGFADQVANVARFLCSDLADYIHGTTIYVDGGMTNY 59 



View Prodom 95301 1 Boxer |*| | Showing match \r\ [Go 



>95301 p99.2 (1) 027957ARCFU // SHIKIMATE 5 -DEHYDROGENASE 
AROE HYPOTHETICAL 

PROTEIN 

Length =108 

Score = 86 (35.3 bits), Expect = 0.0014, P= 0.0014 
Identities = 20/48 (41%), Positives = 31/48 (64%) 

Ouerv 35 LQGQVAIVTGGATGIGKAIVKELLELGSNWIASRKLERLKSAADELQ 82 

L G+ A+V G A G GKA LL++GS V++A+R E+ + A + L+ 
Sbjct- 10 LGGKTALWG-AGGAGKAAALALLDMGSTVIVANRTEEKGREAVEMLR 56 



Boxer ▼ Showing match 



Go! 



View Prodom 73753 

>73753 p99.2 (1) P71079_BACSU // UNIDENTFIED DEHYDROGENASE 
Length = 60 " 

Score = 84 (34.6 bits), Expect = 0.0023, P =0.0023 
Identities = 20/50 (40%), Positives = 29/50 (58%) 

Ouerv 237 QSFFEGSFQKIPAKRIGVPEEVSSWCFLLSPAASFITGQSVD VDGGRSL 286 
W Y * + E + Q PA R+ +++ V FL+S A I GQ++ VDGGRSL 

Sbjct- 9 EDLLEDARQNTPAGRMVEIKDMVDTVEFLVSSKADMIRGQTIIVDGGRSL 58 



View Prodom 121622 I Boxer |t] 1 Showing match |t| [Gol] 



>121622 p99.2 (1) YS05_CAEEL // HYPOTHETICAL 98.0 KD PROTEIN F56D1.5 IN 
CHROMOSOME II TRANSMEMBRANE 
Length = 194 

Score = 70 (29.7 bits), Expect = 7.6, P = 1.0 
Identities = 20/57 (35%), Positives = 29/57 (50%) 

Ouerv 29 YLAPG; ; QGQV- -AIVTGGATGIGKAIVKELLELG-SNWIASRKLERLKSAADELQ 82 
Query. +V+GG GIGKA EL + G V+ R ++L S E++ 

Sbjct: 62 FYKPNLEQYQHRWTWSGGTDGIGKAYTLELAKRGLRKFVLIGRNPKKLDSVKSEIE 118 

Fig. 21 B 
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GGAATGGATGCTGTTGGCTTAAACCTCCCCCTGCCCTGGGGGTTGCAACCAGGGTCTCTG 
CAAAGCCAATCCTTTGTCATCCCGCTGTCCTGCAGAGCAA GATGGGGCTCATGGCTGTCC 
TGATGCTACCCCTGCTGCTGCTGGGAATCAGCGGCCTCCTCTTCATTTACCAGGAGGCAT 
CCAGGCTGTGGTCGAAGTCTGCCGTGCAGAACAAAGTGGTGGTCATCACAGATGCCATCT 
CAGGACTGGGAAAGGAGTGTGCTCGGGTGTTCCATGCAGGTGGGGCAAGGCTGGTGCTGT 
GTGGAAAGAACTGGGAGGGACTGGAGAGCCTCTATGCCACCTTGACCAGTGTGGCTGACC 
CCAGCAAGACATTCACCCCCAAGCTGGTCCTCCTGGATCTCTCAGACATTAGCTGTGTTC 
AAGATGTGGCCAAAGAGGTCCTGGACTGCTACGGCTGTGTGGACATCCTCATCAACAATG 
CCAGCGTGAAAGTGAAGGGGCCTGCCCACAAGATTTCCCTGGAGCTTGACAAAAAGATCA 
TGGATGCCAACTACTTCGGACCCATCACTTTAACCAAAGTTCTGCTTCCCAACATGATCT 
CCAGGAGAACAGGCCAGATTGTGTTAGTGAACAACATCCAAGCGAAGTTTGGAATCCCGT 
TCCGCACAGCTTATGCAGCCTCTAAGCATGCCGTCATGGGCTTCTTTGACTGCCTCCGAG 
CCGAGGTTGAGGAATACGATGTTGTGGTCAGCACCGTGAGCCCAACTTTCATCCGCTCCT 
ACCGTGCTTCCCCTGAGCAAAGAAACTGGGAGACATCCATTTGTAAATTCTTCTGCAGGA 
AGCTAGCCTATGGCGTGCACCCGGTGGAGGTGGCTGAGGAAGTGATGCGCACAGTACGGA 
GGAAGAAGCAAGAGGTGTTCATGGCCAACCCGGTTCCTAAGGCTGCCGTGTTCATCCGCA 
CCTTCTTCCCTGAGTTCTTCTTCGCTGTGGTGGCCTGTGGGGTGAAGGAGAAGCTCAATG 
TCCCAGAAGAGGGT TAACCTCGTGGCCAAAGGGGTCACTCAAGGGGAATAAAGGCTTTCC 
TAGAGAAAAAAAAAAAAAAAAAAAAAAA 



Fig. 31 A 
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MGLMAVLMLPLLLLGI SGLLFI YQEASRLWSKSAVQNKWVITDAI SGLGKECARVFHAG^ 

GARLVLCGKNWEGLESLYATLTSVADPSKTFTPKLVLLDLSDISCVQDVAKEVLDCYGCV 

DILINNASVKVKGPAHKISLELDKKIMDANYFGPITLTKVLLPNMISRRTGQIVLVNNIQ 

AKFGIPFRTAYAASKHAVMGFFDCLRAEVEEYDWVSTVSPTFIRSYRASPEQRNWETSI 

CKFFCRKLAYGVHPVEVAEEVMRTVRRKKQEVFMANPVPKAAVFIRTFFPEFFFAWACG 

VKEKLNVP E EG . 



Fig. 31 B 
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GAP of: FrGcgManager_31_UFAHDJyG_ check: 516 from: 1 to: 936 
M21481 ORF - Import - vector trimmed 

to: FrGcgManager_31_VFA0zr_19 check: 2871 from: 1 to: 933 
h21481 ORF - Import - vector trimmed 

Symbol comparison table: /ddm_local/gcg/gcg_9 .1/gcgcore/data/rundata/ 
nwsgapdna . cmp 
CompCheck: 8760 

Gap Weight: 12 Average Match: 10.000 

Length Weight: 4 Average Mismatch: 0.000 

Quality: 8220 Length: 936 

Ratio: 8.810 Gaps: 0 

Percent Similarity: 88.103 Percent Identity: 88.103 

Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 5 
. = 1 

FrGcgManager_31_UFAHDJyG_ x FrGcgManager_31_VFA0zr_19 

1 ATGGGGCTCATGGCTGTCCTGATGCTACCCCTGCTGCTGCTGGGAATCAG 50 

iiiii iiiiiii i iiiiiim in iniiiiiiiiiiiiiiii qn 

1 atgggagtcatggccatgctgatgctccccctgctgctgctgggaatcag 50 

• • * 

51 CGGCCTCCTCTTCATTTACCAGGAGGCATCCAGGCTGTGGTCGAAGTCTG 100 

IIIIIIIIIIIMIIMIHI Mil IIIIIIIIIMIII Mill I 

51 iggcctcctcttcatttaccaagaggtgtccaggctgtggtcaaagtcag 100 
101 CCGTGCAGAACAAAGTGGTGGTCATCACAGATGCCATCTCAGGACTGGGA 150 

I M I II 1 1 II II 1 1 M M I II Mill MMIIMIMMIMIMI 

101 ctgtgcagaacaaagtggtggtgatcaccgatgccatctcaggactgggc 150 
151 AAGGAGTGTGCTCGGGTGTTCCATGCAGGTGGGGCAAGGCTGGTGCTGTG 200 

Mil Ml MUIIMMMII M M I MM II II II 1 1 III 1 1 1 1 IN 200 

151 aaggagtgtgctcgggtgttccacacaggtggggcaaggctggtgctgtg 200 
201 TGGAAAGAACTGGGAGGGACTGGAGAGCCTCTATGCCACCTTGACCAGTG 250 

II II MM III IIIII I II MM Ml MM I II II I Ml I 

201 tggaaagaactgggagaggctagagaacctatatgatgccttgatcagcg 250 
251 TGGCTGACCCCAGCMGACATTCACCCCCMGCTGGTCCTCCTGGATCTC 300 

1 1 1 II II II II II II II I II II II II II II II II II III MM HI , no 

251 tggctgaccccagcaagacattcaccccaaagctggtcctgttggacctc 300 

Fig. 32A 
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301 


TCAGACATTAGCTGTGTTCAAGATGTGGCCAAAGAGGTCCTGGACTGCTA 


350 


301 


IMIIIM MINIM 1 MINIMI INN IIIINN Mill 

tcagacatcagctgtgtcccagatgtggcaaaagaagtcctggattgcta 


350 


351 


• • • • • 
CGGCTGTGTGGACATCCTCATCAACAATGCCAGCGTGAAAGTGAAGGGGC 


400 


351 


MIMMNNIMNIIIIMMMININ INN MINIMI! 

tggctgtgtggacatcctcatcaacaatgccagtgtgaaggtgaaggggc 


400 


401 


• • • • « 
CTGCCCACAAGATTTCCCTGGAGCTTGACAAAAAGATCATGGATGCCAAC 


450 


401 


MINN NINNI IIIINN IIIIIININIIIIIINIIII 

ctgcccataagatttctctggagctcgacaaaaagatcatggatgccaat 


450 


451 


• • • • « 
TACTTCGGACCCATCACTTTAACCAAAGTTCTGCTTCCCAACATGATCTC 


500 


451 


INN II IIIINN II II INI NNNNNNININII 

tactttggccccatcacattgacgaaagccctgcttcccaacatgatctc 


500 


501 


• ♦ • ♦ « 
CAGGAGAACAGGCCAGATTGTGTTAGTGAACAACATCCAAGCGAAGTTTG 


550 


501 


i iNiiiiiiiiii ii iiiiiiiiiii ii mini NNNN 

ccggagaacaggccaaatcgtgttagtgaataatatccaagggaagtttg 


550 


551 


• • • • • 

GAATCCCGTTCCGCACAGCTTATGCAGCCTCTAAGCATGCCGTCATGGGC 


600 


551 


NNNNINN 11 Nil II INN INN II 1 1 INN 

gaatcccgttccgtacgacttacgctgcctccaagcacgcagccctgggc 


600 


601 


• • ♦ • • 
TTCTTTGACTGCCTCCGAGCCGAGGTTGAGGAATACGATGTTGTGGTCAG 


650 


601 


NINNIININNNINN II limillNIIIIIN INI 

ttctttgactgcctccgagccgaagtggaggaatacgatgttgtcatcag 


650 


651 


• • • • « 
CACCGTGAGCCCAACTTTCATCCGCTCCTACCGTGCTTCCCCTGAGCAAA 


700 


651 


illinium iiiiiiiiiii n mi 1 i n mm 

caccgtgagcccgactttcatccggtcgtaccacgtgtatccagagcaag 


700 


701 


« • • • # 
GAAACTGGGAGACATCCATTTGTAAATTCTTCTGCAGGAAGCTAGCCTAT 


750 


701 


NIIININ 1 NNNN NINNI 1 IMIIIM! Mil 

gaaactgggaagcttccatttggaaattctttttcaggaagctgacctac 


750 


751 


• • • * • 

GGCGTGCACCCGGTGGAGGTGGCTGAGGAAGTGATGCGCACAGTACGGAG 


800 


751 


IIIIIIIIIII N NINNI INN IIIIIIIIIII II Mill 

ggcgtgcacccagtagaggtggcggaggaggtgatgcgcaccgtgcggag 


800 


801 


• • • • • 

GAAGAAGCAAGAGGTGTTCATGGCCAACCCGGTTCCTAAGGCTGCCGTGT 


850 


801 


iiiniiiiiiiniiii iiiiiiiiiii i ii inn mini 

gaagaagcaagaggtgtttatggccaaccccatccccaaggccgccgtgt 


850 




• • • • * 
TCA x LLGCACCTTCTTCCCTGAGTTCTTCTTCGCTGTGGTGGCCT 


AAA 

900 


851 


1 IIIINIIIIIIIIII IIIINN INN I II III 1 1 III III 1 

acgtccgcaccttcttcccggagttctttttcgccgtggtggcctgtggg 


900 


901 


• « • 
GTGAAGGAGAAGCTC AATGTCCCAGAAGAGGGTTAA 936 




901 


IIIIIIIIIIIINIINIIIII II INN 

gtgaaggagaagctcaatgtcccggaggagggg. . , 933 






Fig. 32B 
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GAP of: FrGcgManager_32_ZFA004eiD check: 657 from: 1 to: 311 
m21481 aa - Import - complete 

to: FrGcgManager_32_AGAjaPna_ check: 9949 from: 1 to: 311 
h214 81 aa - Import - complete 

Symbol comparison table: /prod/ddm/seqanal/BLAST/matrix/aa/BLOSUM62 

CompCheck: 1102 
Matrix made by matblas from blosum62 . 

Gap Weight: 12 Average Match: 2.778 

Length Weight: 4 Average Mismatch: -2.248 

Quality: 1467 Length: 311 

Ratio: 4.717 Gaps: 0 

Percent Similarity: 92.926 Percent Identity: 91.318 

Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 2 
• = 1 

FrGcgManager_32_ZFA004eiD x FrGcgManager_32_AGAj aPna_ .. 
1 MGLMAVLM^rLLLLGISGLLFIYQEASRLWSKSAVQ 50 

. • • 

51 KECARVFHAGGARLVLCGKNWEGLESLYATLTSV^^ 100 

101 SDISCVQDVAKEVLDCYGCVDILINNASTO^ 150 
151 YFGPITLTKVLLPNMISRRTGQIVLVNNIQAKFGIPFRTAYAASK^VMG 200 

• • * 

201 FFDCLRAEVEEYDVWSTVSPTFIRSYRASPEQRNWETSICKFFCRKLAY 250 

• • * 

251 GVHPVEVAEEVMRTVRRKKQEVFMANPVPKAAVFIRTFFPEFFFAVyACG 300 

251 UUUiiiU^UiUiiUUUiiiUiUUiil-UUUUiii 30 o 

301 VKEKLNVPEEG 311 

301 VKEKLNVPEEG 311 Pjg. 33 



